Search CORE

9 research outputs found

Intrinsic bias in breast cancer gene expression data sets

Author: A Naderi
AE Teschendorff
AE Teschendorff
AE Teschendorff
AV Ivshina
B Haibe-Kains
C Desmedt
C Desmedt
C Fan
C Sotiriou
C Sotiriou
C Sotiriou
D Dunkler
DJ Slamon
H Dai
HM Bovelstad
HY Chang
JD Mosley
JD Potter
Jonathan D Mosley
JX Yu
L Ein-Dor
L Harris
LD Miller
LJ van't Veer
MJ van de Vijver
P Eden
Ruth A Keri
S Gruvberger
S Michiels
SK Gruvberger
SY Kim
T Sorlie
WL McGuire
Y Wang
Y Yasui
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background While global breast cancer gene expression data sets have considerable commonality in terms of their data content, the populations that they represent and the data collection methods utilized can be quite disparate. We sought to assess the extent and consequence of these systematic differences with respect to identifying clinically significant prognostic groups. Methods We ascertained how effectively unsupervised clustering employing randomly generated sets of genes could segregate tumors into prognostic groups using four well-characterized breast cancer data sets. Results Using a common set of 5,000 randomly generated lists (70 genes/list), the percentages of clusters with significant differences in metastasis latencies (HR p-value < 0.01) was 62%, 15%, 21% and 0% in the NKI2 (Netherlands Cancer Institute), Wang, TRANSBIG and KJX64/KJ125 data sets, respectively. Among ER positive tumors, the percentages were 38%, 11%, 4% and 0%, respectively. Few random lists were predictive among ER negative tumors in any data set. Clustering was associated with ER status and, after globally adjusting for the effects of ER-α gene expression, the percentages were 25%, 33%, 1% and 0%, respectively. The impact of adjusting for ER status depended on the extent of confounding between ER-α gene expression and markers of proliferation. Conclusion It is highly probable to identify a statistically significant association between a given gene list and prognosis in the NKI2 dataset due to its large sample size and the interrelationship between ER-α expression and markers of proliferation. In most respects, the TRANSBIG data set generated similar outcomes as the NKI2 data set, although its smaller sample size led to fewer statistically significant results.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Time to Recurrence and Survival in Serous Ovarian Tumors Predicted from Integrated Genomic Profiles

Author: A Daemen
A Schramm
AC Tan
AP Crijns
Chris Sander
CT Lopes
DM Witten
Douglas A. Levine
E Cerami
E Noetzel
G Heller
H Zou
HK Dressman
HM Bovelstad
J Helleman
J Subramanian
JJ Peluso
JV Rajan
K Yoshihara
KL Borden
L Ein-Dor
LC Hartmann
M GÖnen
M Zangenberg
MW Causey
MY Park
Nikolaus Schultz
O Smaletz
P Pavlidis
Parminder K. Mankoo
PS Freemont
R Shen
R Tibshirani
Ronglai Shen
S Awasthi
S Dell'Orso
S L'Esperance
S Maere
S Mizuarai
S Wada
SF Slovin
Sumitra Deb
SY Yu
T Bonome
T Ota
V Poroyo
Y Jiang
YT Tai
ZZ Wu
Publication venue: Public Library of Science
Publication date: 03/11/2011
Field of study

Serous ovarian cancer (SeOvCa) is an aggressive disease with differential and often inadequate therapeutic outcome after standard treatment. The Cancer Genome Atlas (TCGA) has provided rich molecular and genetic profiles from hundreds of primary surgical samples. These profiles confirm mutations of TP53 in ∼100% of patients and an extraordinarily complex profile of DNA copy number changes with considerable patient-to-patient diversity. This raises the joint challenge of exploiting all new available datasets and reducing their confounding complexity for the purpose of predicting clinical outcomes and identifying disease relevant pathway alterations. We therefore set out to use multi-data type genomic profiles (mRNA, DNA methylation, DNA copy-number alteration and microRNA) available from TCGA to identify prognostic signatures for the prediction of progression-free survival (PFS) and overall survival (OS). prediction algorithm and applied it to two datasets integrated from the four genomic data types. We (1) selected features through cross-validation; (2) generated a prognostic index for patient risk stratification; and (3) directly predicted continuous clinical outcome measures, that is, the time to recurrence and survival time. We used Kaplan-Meier p-values, hazard ratios (HR), and concordance probability estimates (CPE) to assess prediction performance, comparing separate and integrated datasets. Data integration resulted in the best PFS signature (withheld data: p-value = 0.008; HR = 2.83; CPE = 0.72).We provide a prediction tool that inputs genomic profiles of primary surgical samples and generates patient-specific predictions for the time to recurrence and survival, along with outcome risk predictions. Using integrated genomic profiles resulted in information gain for prediction of outcomes. Pathway analysis provided potential insights into functional changes affecting disease progression. The prognostic signatures, if prospectively validated, may be useful for interpreting therapeutic outcomes for clinical trials that aim to improve the therapy for SeOvCa patients

Public Library of Science (PLOS)

Crossref

PubMed Central

Metabolomics-Based Discovery of Diagnostic Biomarkers for Onchocerciasis

Author: A Dabney
A Hoerauf
A Hoerauf
A Hoerauf
A Hoerauf
AK Smilde
AP Plaisier
Ashlee A. K. Nunes
B Crews
BA Boatin
BA Boatin
CA Smith
CA Smith
EW Cupp
EW Cupp
FO Richards Jr
G Dolce
HM Bovelstad
HR Taylor
HR Taylor
Hélène Carabin
J Park
J Saric
JA Swets
JE Bradley
Judith R. Denery
JV Li
K Awadzi
Kim D. Janda
M Gomez Ravetti
M Hall
MA Rodriguez-Perez
Mark S. Hixon
MC Walsh
MG Basanez
MY Osei-Atweneboana
N Vinayavekhin
P Stingl
R Jornsten
S Baek
S Karlsson
S Ritchie
S Specht
TA Lasko
Tobin J. Dickerson
TS Churcher
Y Dadzie
Y Wang
Y Wang
Z Zhang
Publication venue: Public Library of Science
Publication date: 05/10/2010
Field of study

Onchocerciasis, caused by the filarial parasite Onchocerca volvulus, afflicts millions of people, causing such debilitating symptoms as blindness and acute dermatitis. There are no accurate, sensitive means of diagnosing O. volvulus infection. Clinical diagnostics are desperately needed in order to achieve the goals of controlling and eliminating onchocerciasis and neglected tropical diseases in general. In this study, a metabolomics approach is introduced for the discovery of small molecule biomarkers that can be used to diagnose O. volvulus infection. Blood samples from O. volvulus infected and uninfected individuals from different geographic regions were compared using liquid chromatography separation and mass spectrometry identification. Thousands of chromatographic mass features were statistically compared to discover 14 mass features that were significantly different between infected and uninfected individuals. Multivariate statistical analysis and machine learning algorithms demonstrated how these biomarkers could be used to differentiate between infected and uninfected individuals and indicate that the diagnostic may even be sensitive enough to assess the viability of worms. This study suggests a future potential of these biomarkers for use in a field-based onchocerciasis diagnostic and how such an approach could be expanded for the development of diagnostics for other neglected tropical diseases

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Using automated texture features to determine the probability for masking of a tumor on mammography, but not ultrasound

Author: A Gastounioti
A Manduca
EE Fowler
G Ursin
HM Bovelstad
JE Olson
JH Friedman
JJ Heine
JJ Heine
K Kerlikowske
KR Brandt
L Breiman
L Häberle
L Häberle
L Häberle
LF Wessels
M Kallenberg
MJ Pencina
MW Beckmann
P Bühlmann
R Tibshirani
RL Schild
RR Winkel
S Destounis
S Malkov
S Varma
TM Kolb
WA Berg
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Gene Dosage, Expression, and Ontology Analysis Identifies Driver Genes in the Carcinogenesis and Chemoradioresistance of Cervical Cancer

Integrative analysis of gene dosage, expression, and ontology (GO) data was performed to discover driver genes in the carcinogenesis and chemoradioresistance of cervical cancers. Gene dosage and expression profiles of 102 locally advanced cervical cancers were generated by microarray techniques. Fifty-two of these patients were also analyzed with the Illumina expression method to confirm the gene expression results. An independent cohort of 41 patients was used for validation of gene expressions associated with clinical outcome. Statistical analysis identified 29 recurrent gains and losses and 3 losses (on 3p, 13q, 21q) associated with poor outcome after chemoradiotherapy. The intratumor heterogeneity, assessed from the gene dosage profiles, was low for these alterations, showing that they had emerged prior to many other alterations and probably were early events in carcinogenesis. Integration of the alterations with gene expression and GO data identified genes that were regulated by the alterations and revealed five biological processes that were significantly overrepresented among the affected genes: apoptosis, metabolism, macromolecule localization, translation, and transcription. Four genes on 3p (RYBP, GBE1) and 13q (FAM48A, MED4) correlated with outcome at both the gene dosage and expression level and were satisfactorily validated in the independent cohort. These integrated analyses yielded 57 candidate drivers of 24 genetic events, including novel loci responsible for chemoradioresistance. Further mapping of the connections among genetic events, drivers, and biological processes suggested that each individual event stimulates specific processes in carcinogenesis through the coordinated control of multiple genes. The present results may provide novel therapeutic opportunities of both early and advanced stage cervical cancers

Crossref

Directory of Open Access Journals

PubMed Central

Proceedings of the 2008 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) Conference

Author: A Churbanov
A Churbanov
A Fujita
A Gyenesei
A Hijikata
A Rawat
A Shipra
AA Ptitsyn
AA Ptitsyn
AA Ptitsyn
AW Schreiber
B Roux
CA Bottoms
CB Giles
D Quest
D Sean
D Wilkins
Dawn Wilkins
ES Chen
G Gamberoni
H Hong
H Liu
H Meng
H Xu
HM Bovelstad
I Fishel
I Medina
James C Fuscoe
Jonathan D Wren
JS Yuan
JS Zielinski
JW Fan
K Thomson
L Guo
L Hertzberg
L Narlikar
L Shi
LK Schnackenberg
LL Elo
M Chae
M Landry
M Mete
M Mete
M Pirooznia
MA Hibbs
MD Dyer
MF Burkart
MG Dozmorov
MG Dozmorov
MK Das
N Mei
ND Mukhopadhyay
O Uzuner
P Li
P Minguez
QH Zhu
R Loganantharaj
RL Frank
RS Wang
S Gao
S Martin
S Sonnenburg
S Winters-Hilt
S Winters-Hilt
S Winters-Hilt
S Winters-Hilt
S Winters-Hilt
S Yuan
SB Montgomery
SM Bridges
Stephen Winters-Hilt
Susan Bridges
T Huan
T Lee
V Kulkarni
V Nagarajan
VI Torvik
WK Lim
WS Sanders
X Chen
Y Ding
Y Gusev
Y Huang
Y Lin
Yuriy Gusev
Z Su
Z Yu
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Assessment of reproducibility of cancer survival risk predictions across medical centers

Author: A Bhattacharjee
A Dupuy
A Fernandez-Teijeiro
AA Alizadeh
AC Justice
CM Balch
CM Balch
CM Balch
CM Balch
DG Beer
DR Cox
E Bair
FE Harrell Jr
GJ Gordon
HC Chen
HC Van Houwelingen
HM Bovelstad
HM Bovelstad
Hung-Chia Chen
HY Chen
I Drozdov
J Subramanian
J Subramanian
J Subramanian
James J Chen
JE Korkola
JJ Chen
JJ Smith
JY Cho
K Shedden
M Banerjee
M Radespiel-Troger
M Schemper
M Schemper
MAQC Consortium
MR Segal
MR Segal
MW Kattan
O Decaux
PA Gimotty
PJ Heagerty
PR Greipp
R Newson
R Simon
R Simon
RA Irizarry
RM Simon
RM Simon
S Tomida
SA Waldman
SJ Mandrekar
SK Lau
SL Yu
TA Gerds
TM Habermann
V‘t Veer LJ
X Huang
Z Hu
Z Sun
ZH Zhu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

SurvExpress: An Online Biomarker Validation Tool and Database for Cancer Gene Expression Data Using Survival Analysis

Author: A Elfilali
AH Bild
Alberto Rodriguez-Barrientos
Antonio Martínez-Torteya
AV Ivshina
B Gyorffy
CGA Network
CQ Zhu
D Venet
Emmanuel Martínez-Ledesma
H Mizuno
H Okayama
HM Bovelstad
Hugo Gomez-Rueda
HY Chen
J Budczies
J Hou
J Subramanian
José G. Tamez-Peña
K Shedden
KJ Kao
L Corradi
LM Schriml
M Raponi
M Ringner
P Jezequel
PJ Heagerty
Rafael Chacolla-Huaringa
Raul Aguirre-Gamboa
S Paik
SE Kern
Victor Treviño
William C. S. Cho
Y Wang
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref

Prediction of Ischemic Events on the Basis of Transcriptomic and Genomic Profiling in Patients Undergoing Carotid Endarterectomy

Classic risk factors, including age, smoking, serum cholesterol, diabetes and blood pressure, constitute the basis of present risk prediction models but fail to identify all individuals at risk. The objective of this study was to investigate if genomic and transcriptional patterns improve prediction of ischemic events in patients with established carotid artery disease. Genotype and gene expression profiles were obtained from carotid plaque tissue (n = 126) and peripheral blood mononuclear cells (n = 97) of patients undergoing carotid endarterectomy. Patients were followed for an average of 44 months, and 25 ischemic events occurred (18 ischemic strokes and 7 myocardial infarctions). Blinded leave-one-out cross-validation on Cox regression coefficients was used to assign gene expression–based risk scores to each patient. When compared with classic risk factors, addition of carotid plaque gene expression–based risk score improved the prediction of future ischemic events from an area under the curve (AUC) of 0.66 to an AUC of 0.79. The inclusion of gene expression risk score from peripheral blood mononuclear cells or from 25 established myocardial infarction risk single nucleotide polymorphisms only exhibited marginal effects on the prediction of ischemic events. Prediction of ischemic events is improved by inclusion of gene expression profiling from carotid endarterectomy tissue compared with prediction on the basis of classic risk markers alone in patients with atherosclerosis. The method may be developed to identify subjects at very high risk of ischemic events

Crossref

PubMed Central